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1 "Detection of Protein Interactions" 

2 

3 Field of the Invention 
4 

5 The present invention relates to a method of 

6 detecting interactions. In particular, but not 

7 exclusively, the invention relates to a method of 

8 detecting protein to protein interactions using 

9 fluorescence. 
10 

11 Background to the Invention 
12 

13 Protein to protein interactions play a key role in 

14 many biological processes including the assembly of 

15 enzymes, protein homo /hetero- oligomers, regulation 

16 of intracellular transport, gene expression, 

17 receptor-ligand interactions, entry of pathogens 

18 into the cell and the action of small molecules or 

19 drugs . 
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1 identification and characterisation of 

2 macromolecular interactions can be performed using 

3 co-immunoprecipitation from cell lysates and 

4 solubilised membranes. However, this technique 

5 requires specific antibodies for both capture and 

6 identification of proteins and may further require 

7 the use of detergent to disrupt interactions. 
8 

9 More recently non-invasive techniques have been 

10 developed to determine protein to protein 

11 interactions . 
12 

13 Such non- invasive techniques were pioneered by the 

14 yeast two hybrid method which is based on 

15 complementation of a split yeast nuclear 

16 transcription factor. 
17 

18 The use of yeast expression systems to identify 

19 mammalian protein-to-protein interaction suffers 

20 from a number of disadvantages. Certain post- 
21 translational modifications, that are normally 

22 critical to mammalian protein interactions, cannot 

23 be achieved by expression and / or post 



24 



28 
29 
30 



translational modification of proteins by yeast 



25 cells. For example, tyrosine phosphorylation is the 

26 key to many m^ialiar cellular protein binding 

27 events involved in signal transduction. However, the 



yeast genome contains no tyrosine kinase genes so 
phosphotyrosine-dependent protein interactions 
cannot be accessed in yeast two hybrid studies . 
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3 

1 Furthermore, in yeast two hybrid screening the 

2 protein complex must be able to translocate to the 

3 nucleus to cause expression of the reporter gene or 

4 cause downstream events to trigger the expression of 

5 a reporter gene. Thus, proteins that are excluded 

6 from the yeast nucleus will not be accessible to 

7 this screening method. 
8 

9 Further methods such as protein complementation and 

10 the split ubiquitin method utilise similar 

11 underlying concepts to the yeast two hybrid method 

12 in that the interaction of two proteins (a bait 

13 protein and prey protein) act to express a reporter 

14 gene, the reporter gene allowing the interaction 

15 event to be visualised as a detectable signal. 
16 

17 Such methods which utilise the expression of a 

18 reporter gene such as an enzyme to produce a 

19 detectable signal suffer from the disadvantage that 

20 the location of the protein complexes being detected 

21 cannot be accurately visualised in the cell. 
22 

23 Recently the technique of fluorescence energy 

24 transfer (FRET) has been used to determine protein 

25 to protein interactions. In this technique the 

26 interaction of two f luorophores , an absorbing moiety 

27 and a fluoresceing moiety, indicates their close 

28 spatial proximity. For protein to protein 

29 interaction monitoring, the absorbing moiety is 

30 added to a first protein partner and the fluorescing 

31 moiety is added to a second binding partner. 

32 Provided the emission spectrum of the absorbing 
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4 

1 moiety overlaps the excitation spectrum of the 

2 fluorescing moiety and both moieties are within 100A 

3 of each other FRET will occur. 
4 

5 FRET can utilise mutations in the sequence of green 

6 fluorescent protein (GFP) from the jellyfish 

7 Aequorea victoria which have been shown to cause 

8 variations in the spectral emission of GFP. These 

9 mutations give rise to variants of GFP such as 

10 Yellow Fluorescent Protein (YFP) , as well as cyan 

11 . (CFP) and blue (BFP) fluorescing variants. This 

12 technique uses fluorescent energy transfer between 

13 these colour variants of GFP fused to interacting 

14 proteins. Unfortunately, this method requires 

15 overexpression of the GFP fusion proteins to allow 

16 quantification of the small changes in fluorescence. 

17 Related methods to FRET require the use of 

18 irreversible photobleaching (FRAP) or expensive 

19 instruments capable of measuring fluorescence 

20 lifetime imaging (FLIM) . 
21 

22 It has recently been shown that fluorescence can be 

23 generated following the functional association of 

24 two separate fragments of the GFP molecule (hapto- 

25 GFPs) when driven by the interaction of a pair of 

26 proteins fused via a VmYer to the new C and N' 

27 termini of the hapto-GFPs. (Ghosh et al, (2000); Hu 

28 et al, (2002) . 
29 

30 Whilst the methods disclosed by these documents may 

31 be used in determining whether interaction occurs 

32 between specific proteins they are not suitable for 
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1 screening the interactions of peptides of which the 

2 mode of binding is unknown. 
3 

4 Conventionally, the length of the linkers used is 

5 chosen from a knowledge the peptides whose 

6 interaction with each other is being tested. From 

7 this knowledge a suitable linker length which allows 

8 association of the fragments of fluorescent protein 

9 following the peptide interaction can be chosen. A 

10 knowledge of the peptides of interest or their mode 

11 of binding to each other has been considered to be 

12 required. 
13 

14 For example, if the peptides interact with each 

15 other such that they form an anti-parallel complex 

16 (hapto-GFP-l^-X^ -.binding to ;C 2 ->Itf 2 -hapto-GFP) and 

17 the fluorescent fragments are orientated such that 

18 they are directed away from each other in space then 

19 long linkers would be required to allow the 

20 fragments of fluorescent protein to interact. If 

21 short linkers were used, despite interaction of the 

22 peptides of interest occurring, then this might not 

23 be detected as the fragments would be prevented from 

24 associating with each other due to the 

25 stereochemical hindrance from the linkers . This 

oc wmxid result in a false negative result in an assay 

27 method. 

28 

29 

30 

31 

32 
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1 Summary of the Invention 
2 

3 The inventors through extensive work have developed 

4 a robust system which overcomes many of the problems 

5 of the prior art and provides for the first time a 

6 general screening method which may used to determine 

7 interaction between unknown peptides. 
8 

9 According to a first aspect of the invention there 

10 is provided a protein interaction system comprising 
11 

12 a plurality of bait fusion proteins, each 

13 fusion protein comprising (i) a first fragment 

14 of fluorescent protein, a first peptide of 

15 interest and a linker portion interposed 

16 between the first peptide and first fluorescent 

17 fragment; wherein the linker portions of each 

18 bait fusion protein are of different lengths, 

19 and the first peptide of interest of each bait 

20 fusion protein is identical to the first 

21 peptide of interest in each of the other bait 

22 fusion proteins, 
23 

24 and (ii) at least one prey fusion protein 

25 comprising_a. fragment of fluorescent protein 

26 complement-" rv t-o «-s^ <=irst fragment of 

27 fluorescent protein, a second peptide of 

28 interest and a second linker portion interposed 

29 between the complementary fragment and the 

30 second peptide; 
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1 wherein, on interaction of a first peptide of 

2 interest with a second peptide of interest, the 

3 fragments of the fluorescent protein 

4 functionally associate to promote fluorescence. 

5 

6 Thus, fluorescence will only be promoted when 

7 peptides of interest of bait and prey fusion 

8 proteins, having suitable linker lengths to allow 

9 the respective fluorescent protein fragments to 
10 associate, are used. 

11 

12 The provision of a peptide of interest linked to a 

13 fluorescent fragment via a range of linker lengths 

14 is advantageous over a single linker length as such 

15 a range maximises the chances of an interaction 

16 between peptides of interest being detected and 

17 minimises the chances that the fluorescent fragments 

18 cannot associate with each other due to 

19 stereochemical hindrance or that the linkers are too 

20 flexible (too long) and thus the fragments are not 

21 being brought together in space despite the proteins 

22 of interest interacting. 
23 

24 The provision of .fusion proteins wherein the fusion 

25 proteins comprise linkers of different lengths 

26 allows for the first time the provision of a general 

27 method which can be used to study the interaction of 

28 peptides of known and / or unknown structure and 

29 also with bulkier peptides of interest and small 

30 peptides of interest which interact with each other 

31 such that the fragments of fluorescent protein are 
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1 directed away from each other or peptides of unknown 

2 structure. 
3 

4 Preferably at least three different linker lengths 

5 are provided. More preferably at least four, even 

6 more preferably at least five different linker 

7 lengths are provided. 
8 

9 in an embodiment of the protein interaction system, 

10 the system may additionally comprise at least one 

11 bait fusion protein which is identical to one of the 

12 bait fusion proteins provided by the plurality of 

13 bait fusion proteins. 
14 

15 A plurality of prey fusion proteins may be provided. 

16 The linker portions of at least two prey fusion 

17 proteins may be of different lengths. For example 

18 two prey fusion proteins may be provided each 

19 comprising the same protein of interest and same 

20 fluorescent fragment, but provided with linkers of 

21 different lengths e.g. 10 amino acid residues and 20 

22 amino acids respectively. 
23 

24 In one embodiment the linker portions comprise in 

25 the range 5 to 60 amino acid residues, more 

26 preferably in the rana<= R f-.o 60 amino acid, yet more 

27 preferably in the range 20 to 60 amino acid 

28 residues. 



29 
30 



in a preferred embodiment at least one of the linker 



31 portions has at least 20 amino acids. 
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1 In particular embodiments of the invention a linker 

2 may comprise greater than 25 amino acids, for 

3 example greater than 30 amino acids, greater than 35 

4 amino acids, greater than 40 amino acids, greater 

5 than 50 amino acids or greater than 55 amino acids 

6 . in length. 
7 

8 Preferably, the linker comprises up to 60 amino 

9 acids . 
10 

11 More preferably the linker comprises up to 45 amino 

12 acids . 
13 

14 Preferably the linker is comprised, of substantially 

15 hydrophillic amino-acid residues. 
16 

17 More preferably at least one, preferably each of th< 

18 linkers comprises multiples of a pentapeptide 

19 sequence such as glycyl-glycyl-glycyl-glycyl-serine 

20 (SEQ ID NO: 1) - 
21 

22 Any fluorescent protein in which appropriate split 

23 sites can be formed and which the resulting 

24 fragments can associate with each other and cause 

25 fluorescence may be used in the invention. Examples 

26 of fluorescent proteins include red fluorescent 

27 protein and blue, yellow and cyan variants of GFP. 

28 Moreover, variants of GFP which have increased 

29 fluorescence may be utilised. However, in a 

30 preferred embodiment the fragments of fluorescent 

31 protein are fragments of green fluorescent protein, 

32 mutants or variants thereof. 
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1 

2 More preferably the fluorescent protein is a 

3 humanised form of a fluorescent protein, e.g. 

4 Enhanced Green Fluorescent Protein (EGFP) or a 

5 variant thereof. 
6 

7 In a humanised nucleotide sequence one or more of 

8 the codons in the sequence are altered such that for 

9 the amino acid being encoded, the codon used is that 

10 which most frequently appears in humans. This is 

11 advantageous as a humanised fluorescent protein 

12 construct e.g. (EGFP) has maximised expression 

13 levels and rate of f lurophore formation in mammalian 

14 cells. This makes detection of fluorescence, 

15 produced by fragments of fluorescent proteins 

16 (fluorogenic fragments) which functionally associate 

17 with each other, easier to determine. 
18 

19 in preferred embodiments, the fragments of 

20 fluorescent protein (fluorogenic fragments) are 

21 generatable through the introduction of a split 

22 point between the amino acids at positions 157 and 

23 158, or (in a second embodiment) between the amino 

24 acids at positions 172 and 173 of the humanised form 

25 of Green Fluorescent Protein (SEQ ID NO 2) shown 

26 below. 
27 

28 SEQ ID NO 2 - EGFP (Clontech Inc.) [Genebank 

29 Accession number gb:AAB02574 gi 1377912] : 

30 1 mvskgeelft gwpilveld gdvnghkfsv sgegegdaty 

31 41 gkltlkfict tgklpvpwpt lvttltygvq cfsrypdhmk 

32 81 qhdffksamp egyvqertif fkddgnyktr aevkfegdtl 
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1 121 vnrielkgid fkedgnilgh kleynynshn vyimadkqkn 

2 161 gikvnfkirh niedgsvqla dhyqqntpig dgpvllpdnh 

3 201 ylstqsalsk dpnekrdhmv llefvtaagi tlgmdelyk 
4 

5 The f luorogenic fragments generated by the 

6 introduction of a split point between the amino acid 

7 residues at positions 157 and 158, or between amino 

8 acid residues at positions 172 and 173, result in 

9 the production of hapto-EGFP 1/157 and hapto-EGFP 158/239 , 
10 or hapto-EGFP 1/172 and hapto-EGFP 173/23 V respectively. 
11 

12 Alternative split points are between residues 23/24, 

13 38/39, 50/51, 76/77, 89/90, 102/103, 116/117, 

14 132/133, 142/143, 190/191, 211/212 or 214/215 of 

15 EGFP. 
16 

17 Thus in preferred embodiments, the f luorogenic 

18 fragments are of amino acid residues 1 to 23, 1 to 

19 38, 1 to 50, 1 to 76, 1 to 89, 1 to 102, 1 to 116, 1 

20 to 132, 1 to 142, 1 to 157, 1 to 172, 1 to 190, 1 to 

21 211 or 1 to 214, and a respective complementary 

22 fragment 24 to 239, 39 to 239, 51 to 239, 77 to 239, 
90 to 239, 103 to 239, 117 to 239, 133 to 239, 143 
to 239, 158 to 239, 173 to 239, 191 to 239, 212 to 



23 
24 

25 239, or 215 to 239 of EGFP. 
27 



It can be envisaged that three or more fluorescent 
fragments may be provided by introducing two split 
points as discussed above into the fluorescent 
protein, each fragment being fused to a peptide of 

31 interest. On interaction of the peptides, the three 

32 or more fluorescent fragments are brought together 



28 
29 
30 



WO 2005/052588 



PCT/GB2004/004928 



12 

1 such that they can functionally associate and 

2 generate a fluorescent signal capable of being 

3 detected. 
4 

5 in another embodiment one or more of the three 

6 fluorescent fragments can be fused to a test agent 

7 such as a small molecule, such as a metal ion. In 

8 this manner, protein interactions which require the 

9 participation of additional test agents, such as 
10 small molecules, can be detected. 

11 

12 in an embodiment of the system wherein a plurality 

13 of prey fusion proteins are present, additionally or 

14 alternatively to prey proteins which comprise 

15 linkers of different lengths at least two of the 

16 second peptides of interest of the prey fusion 

17 proteins may comprise different amino acid 

18 sequences. 
19 

20 The prey fusion peptides may be provided as a 

21 library of different peptides of interest linked to 

22 a fragment of fluorescent protein which is 

23 complementary to the fluorescent fragment of the 

24 bait fusion protein. The library may be an 

25 expression library, a library of a range of 

26 mutations of * « r^r 4 " 5 ar other peptide 

27 libraries as known in the art. 
28 

29 The first peptide of interest may be linked to the N 

30 terminus of the first fragment of fluorescent 

31 protein or alternatively the first peptide may be 
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1 linked to the C terminus of the first fragment of 

2 fluorescent protein. 
3 

4 The second peptide of interest may be linked to the 

5 N terminus of the complementary fragment of 

6 fluorescent protein or alternatively the second 

7 peptide may be linked to the C terminus of the 

8 complementary fragment of fluorescent protein. 
9 

10 The peptides of interest' linked to the fragments of 

11 fluorescent protein can be small peptides of 

12 differing amino acid sequence, for example nonomers, 

13 comprising different amino acid compositions or the 

14 same overall composition, but with the amino acids 

15 present in a different order. Alternatively, the 

16 peptides may be full size proteins e.g. obtained 

17 from a cDNA library. Peptides may be produced 

18 synthetically or recombinantly using techniques 

19 which are widely available in the art. For peptides 

20 translated in the cell, naturally or induced, post- 
21 translational modification for example 

22 glycosylation, lipidation, phosphorylation of the 

23 peptides may occur, and these post translated 

24 products are still to be regarded as peptides. 
25 

26 in one embodiment, the protein interaction system is 

27 a cell based interaction system. 
28 

29 In such a cell based system, each cell preferably 

30 comprises one bait fusion protein of a single 

31 defined linker length. For example, if three bait 

32 fusion proteins are provided each of which has a 
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1 different linker length then a first cell will 

2 comprise a bait fusion protein of a first linker 

3 length, a second cell will comprise a bait fusion 

4 protein of a second linker length and a third cell 

5 will comprise a third bait fusion protein of a third 

6 linker length. 
7 

8 When the protein interaction system is provided as a 

9 cell based system, it may be produced using nucleic 

10 acid constructs which when expressed in live cells 

11 provide the components of the protein interaction 

12 system. 
13 

14 According to a second aspect of the present 

15 invention there is provided a library of nucleic 

16 acid constructs, each construct encoding 

17 (i) a first fragment of fluorescent protein 

18 capable of functional association with a 

19 complementary fragment of fluorescent protein 

20 such that on functional association of said first 

21 and complementary fragments fluorescence is 

22 enabled, 

23 (ii) a peptide of interest, and 

24 (iii) a linker portion interposed between the 

25 peptide and first fragment of fluorescent protein 

26 wherein the peptide -interest encoded by each 

27 nucleic acid construct is the same and the linker 

28 portion of each construct is of a different 

29 length to the linker of each other construct. 
30 

31 In preferred embodiments at least one linker portion 

32 comprises at least 20 amino acids. 
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1 

2 The inventors have determined an economical and 

3 relatively easy way of providing longer (for example 

4 greater than 20 amino acids) linkers in the bait and 

5 /or prey fusion proteins by providing linkers 

6 comprising multiples of a pentapeptide sequence such 

7 as glycyl-glycyl-glycyl-glycyl-serine. Such 

8 sequences provide advantageous flexibility 

9 properties and thus enable the linker region to be 

10 readily extended to provide a robust screening 

11 method. 
12 

13 According to a third aspect of the invention there 

14 is provided an expression vector comprising a 

15 plurality of the constructs as provided by the 

16 second aspect of the invention. 
17 

18 According to a fourth aspect of the invention there 

19 is provided an expression vector comprising at least 

20 one of the plurality of nucleic acid constructs 

21 wherein the at least one nucleic acid construct 

22 encodes a fusion protein having a linker of at least 

23 20 amino acids. 
24 

25 An expression vector may be introduced into a cell 

26 using any known techniques such as calcium phosphate 

27 precipitation, lipofection, electroporation and the 

28 like. 
29 

30 in embodiments of the invention more than one vector 

31 encoding a construct of the third or fourth aspect 

32 of the invention and / or a construct comprising a 
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1 complementary fragment of fluorescent protein may be 

2 introduced to a cell based system. 
3 

4 According to a fifth aspect of the present invention 

5 there is provided an assay method for monitoring 

6 peptide interaction comprising the steps of 
7 

8 providing the protein interaction system as 

9 provided in the first aspect of the invention, 
10 and 

11 

!2 detecting fluorescence produced by the 

13 interaction of first and second peptides of 

14 interest causing fragments of the fluorescent 

15 protein to functionally associate with each 

16 other. 
17 

18 In a particular embodiment the assay method is 

19 performed in vitro. 
20 

21 By providing fusion proteins of the protein 

22 interaction system in a cell based system or by 

23 mixing the fusion proteins of the first and second 



24 
25 
26 
27 
28 
29 
30 



protein of interest together in vitro the assay can 
be used to screen a protein fusion library to 
identify a second pept^- «f interest which binds to 
a first peptide of interest or vice versa. 

An embodiment of the assay may comprise the step of 
observing the subcellular location of the 



31 interaction of the first and second peptides of 



32 



interest in a cell. This step is advantageous as it 
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1 provides details of the location in the cell that 

2 the interaction is taking place, for example at the 

3 membrane, in the cytoplasm, or in the nucleus. 
4 

5 Any methods as known in the art may be used to 

6 determine the subcellular location of interaction, 

7 for example confocal scanning laser microscopy. 
8 

9 The assay method may further comprise the step of 

10 ' observing the level of fluorescence produced at a 

11 range of time points. 
12 

13 This step would allow determination of the 

14 subcellular dynamics of the peptide interactions 

15 visualised by fluorescence observations of living 

16 cells to enable spatio-temporal studies of peptide 

17 interactions throughout all parts of the cell cycle, 

18 for example such techniques would also allow the 

19 trafficking of interacting peptides, for example 

20 from the endoplasmic reticulum (ER) to the plasma 

21 membrane to be tracked. 
22 

23 The assay may comprise the step of determining the 

24 length of the linkers of those fusion proteins which 

25 allow the first fragment and complementary fragment 
o-f t-t><=> fluorescent protein to functionally 

27 complement each other and enable fluorescence to be 

28 detected on interaction of the first and second 

29 proteins of interest. 
30 

31 in such an embodiment the assay method may comprise 

32 the steps of 
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1 

2 providing the protein interaction system as 

3 provided in the first aspect of the invention, 
4 

5 detecting fluorescence produced by the 

6 interaction of the first and second peptides of 

7 interest causing fragments of the fluorescent 

8 protein to functionally associate with each 

9 other, 
' 10 

H selecting those cells in which fluorescence is 

12 detected, 

13 

14 clonally amplifying the cells in which 

15 fluorescence is detected, and 
16 

17 determining the length of the linkers- in said 

18 cells by DNA sequencing. 
19 

20 Determination of the linker length of those fusion 

21 proteins which interact with each other may be 

22 advantageous as the distribution of occurrence of 

23 linker lengths obtained from those cells in which 

24 fluorescence is observed should indicate a sharp 
2.5 cutoff at the lower limit of linker lengths 

26 reflecting the minimum linker length capable of 

27 spanning the separation of the fusion termini of th< 

28 interacting peptides. This in turn can be used to 

29 provide a value of the distance between the 

3 0 interacting peptides in Angstroms on the basis that 

31 each amino acid residue contributes 3.7A to the 
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1 length of each linker in an extended backbone 

2 conformation. 
3 

4 An embodiment of the assay may comprise the further 

5 step of isolating those fusion proteins which are 

6 determined as allowing the first fragment and 

7 complementary fragment of the fluorescent protein to 

8 functionally complement each other and enable 

9 fluorescence to be detected on interaction of the 
10 first and second peptides of interest. 

11 

12 Isolation may be achieved for example using a 

13 fluorescence activated cell sorting machine or laser 

14 microdissection. 
15 

16 in a particular embodiment of this assay laser 

17 excision of cell, amplification of the construct and 

18 sequencing may be used to allow the linker lengths 

19 of those bait and / or prey fusion proteins of 

20 interest which interact to cause fluorescence to be 

21 determined and thus indicate the minimum distance of 

22 the attachment points of the peptides of interest. 
23 

24 The isolated cells and fusion proteins may be 

25 subjected to further analysis, for example 

26 sequencing of the in+~-~~-i^s peptides. The 

27 sequenced peptides may then be compared with 

28 sequences (full length or partial) in a databank so 

29 as to identify or characterise the interacting 

30 peptide isolated from the cell. 
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1 The sequences of the interacting peptides may 

2 alternatively be inferred by cloning selected 

3 fluorescent cells and subjecting the cloned cells to 

4 e.g. PCR amplification and DNA sequencing. These 

5 sequences can then be cloned into expression vectors 

6 and the protein expressed and purified. The 

7 purified protein can be further studied or used for 

8 example in research. 
9 

10 The assay may be used to determine if test agents 

11 are capable of promoting or enhancing interaction of 

12 peptides or alternatively of preventing or 

13 inhibiting the interaction of peptides. 
14 

15 In such an embodiment the assay may comprise the 

16 steps of 
17 

18 providing the protein interaction system as 

19 provided in the first aspect of the invention, 
20 

21 detecting the level of fluorescence produced by 

22 the interaction of the first and second 

23 peptides of interest causing fragments of the 

24 fluorescent protein to functionally complement 

25 each other, 
26 

27 providing a putative interaction modulating 

28 agent, and 
29 

30 detecting the level of fluorescence produced in 

31 the presence of said putative modulating agent, 
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1 wherein detection of fluorescence in the 

2 absence of the putative modulating agent, but 

3 not in the presence of the putative modulating 

4 agent is indicative that the putative 

5 modulating agent prevents or is an inhibitor of 

6 peptide interaction and the detection of 

7 fluorescence in the presence of the putative 

8 modulating agent, but not in the absence of the 

9 putative modulating agent is indicative that 

10 the putative modulating agent promotes or 

11 enhances peptide interaction. 
12 

13 The detected fluorescence may be quantitatively 

14 determined such that fluorescence produced by 

15 different cells or under different conditions can be 

16 compared. 
17 

18 For example, in testing the effects of a putative 

19 modulating agent, any detected fluorescence may be 

20 measured in the absence and presence of the putative 

21 modulating agent wherein a reduction in fluorescence 

22 in the presence of said modulating agent compared to 

23 fluorescence in the absence of said candidate 

24 modulating agent is indicative of inhibition of 

25 complex formation by the modulating agent and an 

26 increase in fluorescence 2 = indicative of promotion 

27 or enhancement of complex formation by the 

28 modulating agent. 
29 

30 Modulation of the interaction between peptides may 

31 be a desirable outcome in the treatment of certain 

32 clinical conditions, or as a research tool to study 
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1 peptide to peptide interactions. For example, 

2 modulation of peptide to peptide interactions may 

3 facilitate the task of determining the steps of 

4 complex pathways by the provision of means to 

5 promote or inhibit a specific interaction, allowing 

6 the effects of other proteins to be studied in 

7 better detail. 
8 

9 Many peptide to peptide interactions require the 

10 participation of small molecules or peptides. Such 

11 a requirement can be determined by simply adding 

12 small molecules or peptides to a cell based system 

13 or to an in vitro mixture containing the fusion 

14 proteins of the interaction system and performing an 

15 assay as described above to determine if these small 

16 molecules or peptides modulate the interaction of 

17 the peptides of interest as determined by detection 

18 or measurement of an alteration in fluorescent 

19 signal . 
20 

21 Thus, embodiments of the assay of the present 

22 invention can be used to select compounds capable of 

23 triggering, stabilising or destablising peptide to 

24 peptide interactions. Embodiments of the assay 

25 method as described herein may be used to screen for 
oc ov^mpio, a receptor agonist, a receptor antagonist, 

27 protein inhibitors, or an inhibitor of protein to 

28 protein interactions. 
29 

30 As will be apparent, the assay of the present 

31 invention can be applied in a format appropriate for 

32 large scale screening, for example, combinatorial 
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4 

1 technologies can be employed to construct 

2 combinatorial libraries of small molecules or 

3 peptides to test as modulating agents . 
4 

5 Preferably, structural information on the peptide to 

6 peptide interaction to be modulated is obtained by 

7 testing different agents to determine if they are 

8 modulating agents. 
9 

10 For example, each of the interacting pair can be '• 

11 expressed and purified and then allowed to interact 

12 in suitable in vitro conditions. Optionally the 

13 interacting peptides can be stabilised by 

14 crosslinking or other techniques. The interacting 

15 complex can be studied using various biophysical 

16 techniques such as X-ray crystallography, NMR, or 

17 mass spectrometry, in addition, information 

18 concerning the interaction can be derived through 

19 mutagenesis experiments for example alanine 

20 scanning, or altering the charged amino acids or 

21 hydrophobic residues on the exposed surface of the 

22 bait or prey peptide being tested. 
23 

24 Based on the structural information obtained, 

25 structural relationship s bet-ween the interacting 

26 peptides as w^i 1 ="= v^*-™^" the modulating compound 

27 and the interacting peptides can be elucidated. 

28 Further, the three dimensional structure of the 

29 interacting moieties and / or that of the modulating 

30 compound can provide information to determine 

31 suitable lead compounds able to modulate 

32 interaction, which medicinal chemists can use to 
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1 design analog compounds having similar moieties and 

2 structures . 
3 

4 in a sixth aspect of the present invention there is 

5 provided novel compounds obtained using an assay of 

6 the invention. 
7 

8 Modulator compounds obtained according to the method 

9 of invention may be prepared as a pharmaceutical 
10 preparation or composition. 

11 

12 Such preparations will comprise the modulating 

13 compound and a suitable carrier, diluent or 

14 excipient. These preparations may be administered 

15 by a variety of routes, for example, oral, buccal, 

16 topical, intramuscular, intravenous, subcutaneous or 

17 the like. 
18 

19 According to an seventh aspect of the present 

20 invention there is provided a kit comprising nucleic 

21 acid constructs as provided in the second aspect of 

22 the invention and means to express the constructs. 
23 

24 The kit may further comprise candidate modulating 

25 agents, which promote, enhance, prevent or inhibit 

26 peptide interaction. 
27 

28 The kit may further comprise nucleic acids which 

29 encode at least one complementary fragment of 

30 fluorescent protein, at least one second peptide of 

31 interest and a second linker portion interposed 
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1 between the complementary fragment and the second 

2 peptide of interest. 
3 

4 in another embodiment the kit comprises a cell in 

5 which a vector comprising constructs of the second 

6 aspect of the invention can be expressed. 
7 

8 The kit may comprise a plurality of second peptides 

9 of interest of different amino acid sequences linked 
10 to a complementary fragment of fluorescent protein. 
11 

12 Additionally, the kit may include instructions for 

13 using the kit to practice the present invention. 

14 The instructions should be in writing in a tangible 

15 form or stored in an electronically retrievable 

16 form. 
17 

18 Preferred features of each aspect of the invention 

19 are as for each of the other aspects mutatis 

20 mutandis unless the context demands otherwise. 
21 

22 Unless otherwise defined, all technical and 

23 scientific terms used herein have the meaning 

24 commonly understood by a person who is skilled in 

25 the art in the field of the present invention. 
26 

27 Throughout the specification, unless the context 

28 demands otherwise, the terms 'comprise' or 

29 1 include', or variations such as 'comprises' or 

30 'comprising', 'includes' or 'including' will be 

31 understood to imply the inclusion of a stated 
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1 integer or group of integers, but not the exclusion 

2 of any other integer or group of integers. 
3 

4 Unless the context demands otherwise, the term 

5 peptide, polypeptide and protein are used 

6 interchangeably to refer to amino acids in which the 

7 amino acid residues are linked by covalent peptide 

8 bonds or alternatively (where post-translational 

9 processing has removed an internal segment) by 

10 covalent di-sulphide bonds, etc. The amino acid 

11 chains can be of any length and comprise at least 

12 two amino acids, they can include domains of 

13 proteins or full-length proteins . Unless otherwise 

14 stated the terms, peptide, polypeptide and protein 

15 also encompass various modified forms thereof, 

16 including but not limited to glycosylated forms, 

17 phosphorylated forms etc. 
18 

19 The term interaction or interacting as used herein 

20 means that two entities, for example, distinct 

21 peptides, domains of proteins or complete proteins, 

22 exhibit sufficient physical affinity to each other 

23 so as to bring the two interacting entities 

24 physically close to each other. An extreme case of 

25 interaction is the formation of a chemical bond that 

26 results in continual, stable proximity of the two 

27 entities. Interactions that are based solely on 

28 physical affinities, although usually more dynamic 

29 than chemically bonding interactions, can be equally 

30 effective at co-localising independent entities. 

31 Physical affinities include, but are not limited to, 

32 for example electrical charge differences, 
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1 hydrophobicity, hydrogen bonds, van der Waals force, 

2 ionic force, covalent linkages, and combinations 

3 thereof. The interacting entities may interact 

4 transiently or permanently. Interaction may be 

5 reversible or irreversible. In any event it is in 

6 contrast to and distinguishable from natural random 

7 movement of two entities. Examples of interactions 

8 include specific interactions between antigen and 

9 antibody, ligand and receptor etc. 
10 

11 Brief description of the figures 
12 

13 The present invention will now be described with 

14 reference to the following non-limiting examples and 

15 with reference to the figures, wherein: 
16 

17 Figure la is a ribbon diagram of EGFP; 
18 

19 Figure lb is an illustration of the split 

20 points and the related sequences surrounding 

21 these split points of EGFP; 
22 

23 Figure 2 is a representation of a hapto-EGFP 

. 24 with a range of linker lengths between the bait 

25 peptide and respective fluorogenic fragment and 

26 a plurality of pep*-^«s linked to a 

27 complementary fluorogenic fragment; 
28 

29 Figure 3 shows fluorescent images of Vero cells 

30 transiently cotransf ected with haptoEGFP 

31 expression constructs, (A) Cells cotransf ected 

32 with pNl57(5)zip and pzip(4)Cl58 in which a 
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1 functional leucine zipper mediates the 

2 association of haptoEGFPl-157 and haptoEGFPl58- 

3 238 to generate fluorescence, (B) Negative 

4 control co trans feet ion using pNl57(5) and 

5 p(4)C158 which lack sequences encoding the 

6 leucine zippers and as such fail to generate 

7 fluorescence, (D) Cells cotransf ected with 

8 pNl72(5)zip and pzip(4)Cl73 in which a 

9 functional leucine zipper mediated association 

10 of haptoEGFPl^l72 and haptoEGFPl73-238 occurs 

11 to generate fluorescence which is of greater 

12 intensity to that observed with the 157/158 

13 split point (E) Negative control 

14 cotransf ection using pNl72(5) and p(4)Cl73 

15 which lack sequences encoding the leucine 

16 zippers and as such fail to generate 

17 fluorescence, (C and F) Confocal images of 
13 cotransfected cells from (A) and (D) showing 

19 the intracellular localisation of fluorescence 

20 - Vero cells were cotransfected with plasmids 

21 encoding linkers ranging in length from 4 to 26 

22 amino acids and UV images were collected at 24 

23 hours post-transfection using identical 

24 exposure times, (G) pNl57(5)zip and 

25 pzip(4)Cl58 (H) pNl57 (15) zip and pzip (14) C158 
2* (i) pNl57 (25) zip ^-^"5 pzip (24)C158 (J) 

27 P Nl57(25)zip and pzip(4)Cl58 (K) pN157(5)zip 

28 and pzip(24)Cl58 (L) a negative untransfected 

29 control illustrates the background fluorescence 

30 level (Italicised figures in brackets indicate 

31 the length of the hydrophilic linker) ; and 
32 
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1 Figure 4 shows the importance of relative 

2 orientations of the haptoEGFP and binding 

3 proteins - figure 4A illustrates the case of 

4 associating membrane proteins where a Type I 

5 and Type II protein combine, both hapto. EGFP 

6 moieties must be on the same side of the 

7 membrane barrier for their combination, 

8 association of membrane proteins of the same 

9 type suffer from the same constraints (figure 

10 4b) wherein to obtain fluorescence fusion to 

11 the appropriate (cytoplasmic ) terminus of the. 

12 binding protein is to the same type of terminus 

13 on both haptoEGFPs (ie: N and N' or C and C 

14 for Type II and Type I respectively) 
15 

16 Functional association of fragments of fluorescent 

17 proteins, brought together by the interaction of 

18 peptides fused to the fragments to screen for 

19 peptide to peptide interactions requires that the 

20 fragments reliably functionally associate only after 

21 interaction of the fused peptides . Fluorescence may 

22 be measured by suitable method known to a person 

23 skilled in the art, for example, fluorescence 

24 spectrometry, lunminescence spectrometry, 

25 fluorescence activated cell analysis, fluorescence 

26 activated cell sorfcincr. ant-oraated microscopy or 

27 automated imaging. 
28 

29 Reliable functional association has to date not been 

30 achieved due to the possibility of steric hindrance 

31 and steric constraints on the functional association 

32 of haptoFPs when bulky proteins are associated to 



WO 2005/052588 



PCT/GB2004/004928 



30 

1 the fluorescent protein fragments due to too short 

2 linkers being interposed between the peptide of the 

3 interest and the fragment of fluorescent protein or 

4 too much flexibility due to too long a linker being 

5 interposed between the same. 
6 

7 The inventors have determined an economical and 

8 reliable method to provide a range of bait fusion 

9 proteins comprising a linker region of varying 

10 length' and thus provide' a robust screening 

11 interaction system and method. 
12 

13 This minimises the problems of steric hindrance, as 

14 a peptide of interest is provided with both 

15 considerable flexibility due to the presence of long 

16 linkers, but also ensures that short linkers are 

17 present such that the fragments of fluorescent 

18 protein are brought into close proximity with each 

19 other. Thus the chance of a false negative result 

20 being obtained, i.e. finding that the peptides of 

21 study do not bind when in fact they do, is reduced. 
22 

23 A general description of the principle of the 

24 invention is shown in figure 2 using haptoEGFPs as 

25 the fluorescent fragments. 
26 

27 As shown in figure 2 protein to protein interaction 

28 searches can be conducted by library interrogation. 

29 The two peptides being tested for interaction are 

30 designated bait and 'prey' W W . Two libraries are 

31 generated (I and II), one series of constructs (here 

32 designated T....Z, library I, >10,000 members) encodes 
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1 a hapto-EGFP followed by a DNA sequence encoding a 

2 60 residue linker attached to the 5 '-end of a cDNA 

3 library, which contains the gene encoding the 

4 'prey', "W" here. The second series of constructs 

5 (a.-e here, library II, <20 members) encodes the 

6 complementary hapto-EGFP followed by a degenerate 

7 linker DNA sequence and the 'bait' gene. All arrows 

8 indicate the direction of the polypeptide backbone 

9 (N->C) . 
10 

11 A. 'Prey' identification: co-transfection with the 

12 'prey' library (I) and construct 'e' (long linker - 

13 preferably 60 amino acid residues) from the 'bait' 

14 library (II) generates fluorescent cells when the 

15 recipient cell receives a vector from library (I) 

16 bearing the 'W gene (in this case) and a second 

17 vector bearing the 'e' bait construct. Clonal 

18 expansion of these fluorescent cells allows 

19 identification of gene 'W* . 
20 

21 B. Proximity measurement: The clone (s) from step A 

22 are co-transf ected with the 'bait' library (II). In 

23 this case cells showing fluorescence synthesise 

24 interacting proteins with a sufficiently long linker 

25 to allow productive complementary hapto-GFP 

.26 interaction, ('d' or 1- this case), as shown to 

27 the left of the diagram. The hollow arrows in the 

28 right hand part of the diagram are intended to 

29 indicate that the interaction of the gene products 

30 with these two constructs generates fluorescence, 

31 while other interactions between the product of gene 

32 'W and the bait protein do not give rise to 
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1 fluorescent cells due to insufficient length of 

2 linker. 
3 

4 Generation of fluorescent fragments 
5 

6 Fluorescent fragments may be provided by any means 

7 known in the art. A first fragment of fluorescent 

8 protein may be an N terminal fragment of fluorescent 

9 protein, e.g. GFP, comprising a substantially 

10 continuous stretch of amino acids from amino acid 

11 number 1 to amino acid X of fluorescent protein and 

12 a second fragment may be a substantially continuous 

13 stretch of amino acids from X+l to around the C 

14 terminal end of the fluorescent protein (e.g. amino 

15 acid 23 8 of GFP), wherein the bond between residue X 

16 and X+l typically is located in a hydrophilic loop 

17 region of the fluorescent protein. Should greater 

18 than two fragments of fluorescent protein require to 

19 be generated for use in assay methods where three or 

20 more fragments of fluorescent protein are linked to 

21 proteins of interest then a N terminal fragment may 

22 comprise a substantially continuous stretch of amino 

23 acids from amino acid number 1 to amino acid X of 

24 fluorescent protein, a second fragment can be 

25 considered as a substantially continuous stretch of 

26 amino acids from X+l to residue Y and a third 

27 fragment may be provided by a substantially 

28 continuous stretch of amino acids from Y+l to around 

29 the C terminal end (e.g. amino acid 238) of 

30 fluorescent protein. In such an example the bonds 

31 between X and X+l and Y and Y+l will be located in 

32 hydrophilic loop regions of fluorescent protein. 
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I Generation of linkers 
2 

3 As shown in figure 2, multiple bait fusion peptides 

4 may be created with linkers of differing lengths. 
5 

6 To enable economical extension of a linker, to 

7 provide linkers of differing lengths, each linker 

8 may be created using overlapping oligonucleotides 

9 encoding repeating (GGGGS) x units wherein the linker 
10 oligonucleotide is engineered to carry a unique 

II restriction site, for example unique Sac I and BamHI 

12 restriction sites, present in a core expression 

13 vector, for example pN EGFP (Sac) zip and p zip (Bam) C EGFP 

14 (Sac I for the hexapapeptide and BarriH I for the 

15 tetrapeptide in example 2 ) . 
16 

17 This allows the insertion of synthetic 

18 oligonucleotides encoding further flexible 

19 hydrophilic linker sequences of the form (GGGGS) n 

20 with the appropriate 5' and 3' sticky ends to 

21 distance the binding peptides (for example leucine 

22 zippers - see example 2) away from the signalling 

23 haptoEGFPs . 
24 

25 Once prepared the constructs may be sequenced before 

26 transfection to confirm correct orientation of the 

27 insert. 
28 

29 Further as illustrated in figure 2, a library of 

30 prey fusion peptides may be provided wherein the 

31 linkers of the prey fusion peptides are of the same 

32 length, but different second peptides of interest 
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1 are fused to the linker region fused to the , 

2 complementary fragment of fluorescent protein. 
3 

4 in general to prepare a library of fusion proteins 

5 of unknown library sequences, the sequence encoding 

6 the hapto-EGFP is fused to the 5' end of the peptide 

7 library due to the presence of downstream stop 

8 codons in the cDNA. 
9 

10 If the gene sequence encoding the protein is 

11 unknown, constructs are required to be generated for 

12 all three reading frames to ensure that one is in 

13 the correct reading frame. 
14 

15 The cDNA sequences should be obtained from a source 

16 which permits directional cloning into restriction 

17 sites which are extremely rare in mammalian DNA. 

18 Suitable sequences may be found in the Large-Insert 

19 cDNA library (Clontech) . 
20 

21 In particular embodiments a core panning vector may 

22 be engineered from existing plasmids to contain a 

23 CMV promoter, an initiation codon, sequences 

24 encoding a hapto-EGFP, an intervening linker, an Sfi 

25 IA site and an Sfi IB site, a stop codon and an SV40 
~e — lyadenylaticsn signal Two additional screening 

27 vectors may be generated to include one and two 

28 extra nucleotides between the linker and the Sfi IA 

29 site to correct the reading frame. cDNA fragments, 

30 flanked with Sfi IA and Sfi IB sites obtained from 

31 the library are cloned downstream of the optimised 

32 hapto-EGFP linker constructs. The hapto-EGFP library 
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1 is then transfected into suitable cells, for example 

2 CHO cells and a mixed population of cells selected 

3 using G418 and passaged to conf luency 
4 

5 Where interaction between the peptides being 

6 screened occurs and the linkers allow association of 

7 the fragments of fluorescent protein, fluorescence 

8 is generated. 
9 

" 10 Any cells which fluoresce may then be isolated by 

11 fluorescent laser microdissection and single cell 

12 RT-PCR performed to identify mRNA which encodes 

13 peptides which interact with the cytoplasmic tails 

14 of the receptor molecules. 
15 

16 Example 1 - Generation of G FP Fragments 
17 

18 The GFP fragments of the interaction system capable 

19 of functional association were generated by split 

20 points at various points along the 239 residue 

21 length of the GFP protein, resulting in the 

22 generation of new C and N' termini which, in three 

23 dimensions, are located at the top and at the base 

24 of the beta-can structure. 
25 

26 Split points ~~r« based on a structure 

27 driven approach between hydrophilic residues. 
28 

29 As shown in Figure 1 the beta-can topology of EGFP 

30 is formed by the eleven strands of the beta 

31 structure. This structure is characterised by 

32 forming three instances of a tripartite antiparallel 
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1 
2 
3 
4 
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6 
7 
8 
9 
10 
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12 
13 
14 
15 
16 



sheet motif joined edge to edge around the periphery 
of the 'can', with the remaining two beta strands 
completing the cylindrical structure. The most 
successful split points obtained to date occur in 
the third tripartite motif between hydrophilic 
residues allowing adjacent hydrophobic side chains 
to promote refolding of the haptoGFPs. 

As shown in the non exhaustive list of Table 1 a 
number of split points were identified using the 
above approach. It would appear that each split 
point in Table 1 is simply one example of a range of 
potentially useful split points, the range being 
shown in parentheses of Table 1. 



Table 1 



Split point 
Number 


Residue 
position in 
EGFP 


Possible 
range 


1 


23/24 


(23-25) 


2 


38/39 


(36-41) 


3 


50/51 


(48-54) 


4 


76/77 


(75-91) 


5 


89/90 


(75-90) 


6 


102/103 


(101-103) 


7 


116/117 


(115-118) 


8 


132/133 


(129-143) 


9 


142/143 


(129-143) 


10 


157/158 


(155-160) 


11 


172/173 


(171-175) 


12 


190/191 


(187-199) 
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13 


211/212 


(207-218) 


14 


214/215 


(207-218) 



1 

2 

3 To extend the versatility of the hapto-EGFP method, 

4 constructs were created where instead of using C 

5 and N 1 for the attachment of heterologus proteins, 

6 the endogenous termini, N or C, together with one of 

7 the new N 1 or C termini were used (C and N' are 

8 those N and C termini created on splitting the GFP 

9 protein into fragments, C is thus equivalent to the 

10 new C terminal produced on the first fragment and N' 

11 is equivalent to the new N terminal produced on the 

12 complementary fragment) . Using this technique the 

13 bait and prey peptides were added such that they 

14 were orientated to the associated fluorogenic 

15 fragments in the same direction as each other, for 

16 example both peptides of interest were attached to 

17 the bottom of the p-can structure of GFP or in the 

18 opposite direction, for example the bait peptide was 

19 attached to the bottom of the p-can structure of 

20 GFP, while the prey protein was attached to the top 

21 of the p-can structure of GFP. As shown in figures 

22 4 A & B, as peptides interact with each other in a 

23 particular orientation, then the direction of the 

24 linkage of the peptide to the N, N' , C or C' end of 

25 the fluorogenic fragment may be important in certain 

26 circumstances so as to allow the fluorescent protein 

27 fragments to functionally interact following 

28 interaction of the peptides. 
29 

30 
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I Example 2 
2 

3 to determine the effect of varying the length of the 

4 intervening hydrophilic linkers interposed between 

5 complementary fragments of fluorescent protein and 

6 leucine zipper proteins known to bind to each other 

7 the linkers were empirically increased in length in 

8 decapeptide units using the general method detailed 

9 above to modify linkers of both pN 157 (6)zip and 

10 pzip(4)C 158 to increase the linker by 10, 20, 30 and 

II 40 residues by the insertion of complementary 

12 oligonucleotides with Sac I and BamH I sites to 

13 generate in the case of the N 157 (6)zip chimera, to 

14 16, 26, 36 and 46 and, in the case of the 

15 complementary zip(4)C 158 chimera, to 14, 24, 34 and 

16 44 residues. 
17 

18 The results of this study are shown in figure 3. 
19 

20 No significant differences in the levels of 

21 fluorescence were observed when the hydrophilic 

22 spacers were lengthened by up to 26 and 24 amino 

23 acids respectively. However, the signal increased 

24 when spacers of 36 and 34 separated the leucine 

25 zipper and the haptoEGFP moieties, whereas the 

26 signal decreased when linkers comprised of 46 and < 

27 amino acids were introduced. 
28 

29 Example 3 
30 

31 utilisation of MV H as a model homo-oligomerising 

32 transmembrane glycoprotein 
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2 in order to demonstrate that this approach can be 

3 used for a wider range of applications than current 

4 reporter systems the membrane glycoproteins of 

5 Measles Virus (MV) were examined. 
6 

7 Measles virus (MV) infection is mediated by a 

8 complex of two viral envelope proteins, 

9 haemagglutinin (H) glycoprotein and fusion (F) 

10 glycoprotein that bind to each other and then 

11 complex with surface receptors to aid the fusion of 

12 the virus with the plasma membrane. The H 

13 glycoprotein is dimerised in the endoplasmic 

14 reticulum and is thought to exist on the cell 

15 surface as a tetramer (dimer of dimers) . The fusion 

16 (F) glycoprotein, is synthesised as an inactive 

17 precursor (F 0 ) which is a highly conserved type I 

18 transmembrane glycoprotein of about 60kDa, which is 

19 cleaved by furin in the trans-golgi to yield the 

20 41kDa (fx) and the 18kDa (f 2 ) disulphide-linked 

21 activated F-protein. Infection of the measles virus 

22 is dependent on the interaction of the F/H complex 

23 with cell surface receptors. 
24 

25 Two constructs, which expressed N157(16)MV-H and 

26 C158 (14)MV-H, were initially generated in order to 

27 investigate homodimerisation of a type II membrane 

28 glycoprotein of unknown structure. The linker 

29 regions of these constructs were generated using 

30 overlapping oligonucleotides which contain Sfi IA 

31 and Sfi IB restriction sites were introduced into 

32 pN 1/157 (16)zip and pC 158/239 (14) zip constructs. These 
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1 chimeras differ from those generated from the 

2 leucine zippers in that the first has H fused to the 

3 C terminus, while the second employs the endogenous 

4 C terminus for fusion. Expression of the chimeric 

5 proteins was detected by immunoblotting cell lysates 

6 using peptide antiserum raised against EGFP (results 

7 not shown) . This demonstrated that the haptoEGFP 

8 tagged H glycoproteins were stably expressed in the 

9 transfected cells. Furthermore, the electrophoretic 

10 mobility of the chimeric proteins suggested that* 

11 they were correctly glycosylated. Fluorescence was 

12 readily detected in living cells and all of the 

13 necessary controls demonstrated that the association 

14 of the haptoEGFPs was specifically driven by the 

15 dimerisation of the H glycoproteins. Fluorescence 

16 was absent from the nucleus but was clearly 

17 demonstrable from the ER through the Golgi to the 

18 plasma membrane of the cells. 
19 

20 It is clear that this methodology could be used to 

* 21 identify further, membrane receptor proteins which 

22 interact with the H protein as could cytoplasmic 

23 proteins which interact with known WJ receptors and 

24 which may therefore initiate downstream signalling 

25 events. 
oc 

27 Example 4 
28 

29 In order to ascertain that the haptoEGFP tagged 

30 glycoproteins were capable of forming a biologically 

31 active complex at the cell membrane cells were 

32 transfected with constructs expressing a number of 
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1 different H and F chimeras. Firstly the bioactivity 

2 of the H chimeras was investigated by co- 

3 transfection with a plasmid expressing the 

4 unmodified F glycoprotein. Initially cell-to-cell 

5 fusion was readily detected 2 d.p.t. in cells 

6 expressing N157 (16) MV-H, C158 (14)MV-H, and F. 
7 

8 Multi-nucleated syncytia comprised of more that 50 

9 cells were obtained which were easily discernible by 
10 phase- contrast microscopy. 

11 

12 Fluorescence was detected by vital confocal laser 

13 microscopy in all syncytia, their size was 

14 comparable to that obtained by co-expression of 

15 unmodified MV F and H. 
16 

17 By three days post-transf ection, cell-to-cell fusion 

18 was detected over large areas of the monolayer and 

19 many syncytia comprised of over 200 individual 

20 cells. Confocal scanning laser microscopy was used 

21 to determine whether localised fluorescence was 

22 present within the syncytia and series of images 

23 were collected. Composite images were generated and 

24 fluorescence localization was examined in the x/z 

25 and y/z planes . Fl 1 1 or escenc e_was detected in the 

26 perinuclear re<-H o-na *nri =>icro Ixl a honeycomb lattice 

27 which is consistent with the presence of the 

28 glycoprotein in the ER and Golgi. 
29 

30 When the plasma membrane was examined in x/z and y/z 

31 it was difficult detect a discrete line of 

32 fluorescence in single sections. However, small 1-5 
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1 ]m vesicles with fluorescent membranes were 

2 frequently detected at the cell surface. These 

3 vesicles are very reminiscent of budding virions and 

4 are approximately 10 times larger than MV virions. 
5 

6 These co- trans f ected cells were fixed in order to 

7 examine the intracellular localisation of 

8 fluorescence within syncytia at higher 

9 magnifications. In the fixed cells it was also 

10 clear that the fluorescence was present in the ER 

11 and Golgi as expected. However, areas of localised 

12 fluorescence were also detected at the periphery of 

13 the syncytia where the fused cells came into contact 

14 with the surrounding cells > suggesting that the H 

15 glycoprotein dimers are not evenly distributed on 

16 the plasma membrane and these accumulations could be 

17 sites of fusion pore formation where the H 

18 glycoproteins are in close contact with the cellular 

19 receptor, in this case CD46. 
20 

21 The extracellular localisation of the H dimers was 

22 also examined by indirect immunofluorescence using 

23 an anti-H MAb on unpermeabilised cells. This 

24 vital immunostaining indicated that a significant 

25 percentage of the dimeric H chimera had been 

26 correctly processed and trafficked to the cell 

27 membrane where, in view of the size of the syncytia, 

28 it was clearly functional. Fluorimetery was used to 

29 determine if the fluorescence could be detected and 

30 quantified. In cells transf ected for defined 

31 periods of time it was found that syncytia formed. 

32 Fluorescent signals were detected which were 
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1 equivalent to those obtained in pNl57(6)zip and 

2 pzip(4)C158 co-transfected cells. No signals were 

3 obtained when the construct which expressed 

4 C158(14)MV-H was replaced by one encoding 

5 zip(14)Cl58 indicating that the specific association 

6 of the H glycoproteins was driving the haptoEGFP 

7 moieties into close enough proximity to enable the 

8 generation of the fluorophore. 
9 

10 Although the invention has been particularly shown 

11 and described with reference to particular examples, 

12 it will be understood by those skilled in the art 

13 that various changes in the form and details may be 

14 made therein without departing from the scope of the 

15 present invention. 
16 

17 
18 



